NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Assume-Guarantee Reinforcement Learning

Kazemi, M; Perez, M; Somenzi, F; Soudjani, S; Trivedi, A; Velasquez, A (February 2024, The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24))

We present a modular approach to reinforcement learning (RL) in environments consisting of simpler components evolving in parallel. A monolithic view of such modular environments may be prohibitively large to learn, or may require unrealizable communication between the components in the form of a centralized controller. Our proposed approach is based on the assume-guarantee paradigm where the optimal control for the individual components is synthesized in isolation by making assumptions about the behaviors of neighboring components, and providing guarantees about their own behavior. We express these assume-guarantee contracts as regular languages and provide automatic translations to scalar rewards to be used in RL. By combining local probabilities of satisfaction for each component, we provide a lower bound on the probability of sat- isfaction of the complete system. By solving a Markov game for each component, RL can produce a controller for each component that maximizes this lower bound. The controller utilizes the information it receives through communication, observations, and any knowledge of a coarse model of other agents. We experimentally demonstrate the efficiency of the proposed approach on a variety of case studies.
more » « less
Full Text Available
LTL-Based Non-Markovian Inverse Reinforcement Learning

Afzal, M.; Gambhir, S.; Gupta, A.; Krishna, S.; Trivedi, A.; Velasquez, A. (May 2023, Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023))

The successes of reinforcement learning in recent years are underpinned by the characterization of suitable reward functions. However, in settings where such rewards are non-intuitive, difficult to define, or otherwise error-prone in their definition, it is useful to instead learn the reward signal from expert demonstrations. This is the crux of inverse reinforcement learning (IRL). While eliciting learning requirements in the form of scalar reward signals has been shown to be effective, such representations lack explainability and lead to opaque learning. We aim to mitigate this situation by presenting a novel IRL method for eliciting declarative learning requirements in the form of a popular formal logic---Linear Temporal Logic (LTL)---from a set of traces given by the expert policy.
more » « less
Full Text Available
The Octatope Abstract Domain for Verification of Neural Networks.

https://doi.org/10.1007/978-3-031-27481-7_26

Bak, S.; Dohmen, T.; Subramani, K.; Trivedi, A.; Velasquez, A.; Wojciechowski, P. (March 2023, Formal Methods. FM 2023.)
Chechik, M.; Katoen, JP.; Leucker, M. (Ed.)
Efficient verification algorithms for neural networks often depend on various abstract domains such as intervals, zonotopes, and linear star sets. The choice of the abstract domain presents an expressiveness vs. scalability trade-off: simpler domains are less precise but yield faster algorithms. This paper investigates the octatope abstract domain in the context of neural net verification. Octatopes are affine transformations of n-dimensional octagons—sets of unit-two-variable-per-inequality (UTVPI) constraints. Octatopes generalize the idea of zonotopes which can be viewed as an affine transformation of a box. On the other hand, octatopes can be considered as a restriction of linear star set, which are affine transformations of arbitrary H-Polytopes. This distinction places octatopes firmly between zonotopes and star sets in their expressive power, but what about the efficiency of decision procedures? An important analysis problem for neural networks is the exact range computation problem that asks to compute the exact set of possible outputs given a set of possible inputs. For this, three computational procedures are needed: 1) optimization of a linear cost function; 2) affine mapping; and 3) over-approximating the intersection with a half-space. While zonotopes allow an efficient solution for these approaches, star sets solves these procedures via linear programming. We show that these operations are faster for octatopes than the more expressive linear star sets. For octatopes, we reduce these problems to min-cost flow problems, which can be solved in strongly polynomial time using the Out-of-Kilter algorithm. Evaluating exact range computation on several ACAS Xu neural network benchmarks, we find that octatopes show promise as a practical abstract domain for neural network verification.
more » « less
Full Text Available
Robust Average-Reward Markov Decision Processes

Wang, Y.; Velasquez, A.; Atia, G.; Prater-Bennette, A.; Zou, S. (January 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Full Text Available
Model-Free Robust Average-Reward Reinforcement Learning

Wang, Y; Velasquez, A.; Atia, G.; Prater-Bennette, A.; Zou, S. (January 2023, International Conference on Machine Learning)

Full Text Available
Active Grammatical Inference for Non-Markovian Planning

Topper, N.; Atia, G.; Trivedi, A.; Velasquez, A. (January 2022, International Conference on Automated Planning and Scheduling)

Full Text Available
Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning

Dohmen, T.; Topper, N.; Atia, G.; Beckus, A.; Trivedi, A.; Velasquez, A. (January 2022, International Conference on Automated Planning and Scheduling)

Full Text Available
Multi-Agent Tree Search with Dynamic Reward Shaping

Velasquez, A.; Bissey, B.; Barak, L.; Melcer, D.; Beckus, A.; Alkhouri, I.; Atia, G. (January 2022, International Conference on Automated Planning and Scheduling)

Full Text Available
Elliptic flow of charged particles at midrapidity relative to the spectator plane in Pb–Pb and Xe–Xe collisions

https://doi.org/10.1016/j.physletb.2022.137453

Acharya, S.; Adamová, D.; Adler, A.; Aglieri Rinella, G.; Agnello, M.; Agrawal, N.; Ahammed, Z.; Ahmad, S.; Ahn, S.U.; Ahuja, I.; et al (November 2023, Physics Letters B)

Full Text Available
f0(980) production in inelastic pp collisions at $\sqrt{s}$ = 5.02 TeV

https://doi.org/10.1016/j.physletb.2022.137644

Acharya, S.; Adamová, D.; Adler, A.; Aglieri Rinella, G.; Agnello, M.; Agrawal, N.; Ahammed, Z.; Ahmad, S.; Ahn, S.U.; Ahuja, I.; et al (November 2023, Physics Letters B)

Full Text Available

« Prev Next »

Search for: All records